Modeling and analysis of multi-library, multi-group SAGE data with application to a study of mouse cerebellum.

نویسندگان

  • Zailong Wang
  • Shili Lin
  • Magdalena Popesco
  • Andrej Rotter
چکیده

A serial analysis of gene expression (SAGE) library is a collection of thousands of small DNA "tags," each of which represents a distinct messenger RNA (mRNA) transcript. Existing methods have been proposed for analyzing single library data (i.e., one library per group) or one tag at a time. The practice of lumping all libraries together (in a multi-library setting) to form a "mega" library for each group is obviously unsatisfactory, but nonetheless performed frequently due to the lack of alternative methods. Because the tag counts within each library are interrelated as they are drawn from a multinomial distribution, analyzing thousands of tags one at a time is undoubtedly inadequate. Not only does such a practice ignore the dependency, but it also faces the multiple testing adjustment issue. This article is an attempt to address both of these issues so that all tags from multi-library groups can be analyzed together. The methods proposed also gear toward multi-group data. Focusing on the problem of identifying genes that are differentially expressed, a Bayesian formulation is established. Under this formulation, the problem of separating the differentially expressed genes from the majority of similarly expressed ones is treated as a model selection problem, and the reversible jump Markov chain Monte Carlo method is adapted for this purpose. The method is applied to a set of mouse libraries to uncover genes that are associated with the process of aging in the cerebellum. Our gene ontology (GO) analysis of the genes selected classifies them into several GO categories, which appear to be functionally relevant to aging.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling and Analysis of SAGE Libraries

A Serial Analysis of Gene Expression (SAGE) library is a collection of thousands of small DNA “tags”, each of which represents a distinct mRNA transcript. Existing methods have been proposed for analyzing single library data (i.e., one library per group) or one tag at a time. The practice of lumping all libraries together (in a multi-library setting) to form a “mega” library for each group is o...

متن کامل

Flood Forecasting Using Artificial Neural Networks: an Application of Multi-Model Data Fusion technique

Floods are among the natural disasters that cause human hardship and economic loss. Establishing a viable flood forecasting and warning system for communities at risk can mitigate these adverse effects. However, establishing an accurate flood forecasting system is still challenging due to the lack of knowledge about the effective variables in forecasting. The present study has indicated that th...

متن کامل

A Multi Linear Discriminant Analysis Method Using a Subtraction Criteria

Linear dimension reduction has been used in different application such as image processing and pattern recognition. All these data folds the original data to vectors and project them to an small dimensions. But in some applications such we may face with data that are not vectors such as image data. Folding the multidimensional data to vectors causes curse of dimensionality and mixed the differe...

متن کامل

Modeling and Hybrid Pareto Optimization of Cyclone Separators Using Group Method of Data Handling (GMDH) and Particle Swarm Optimization (PSO)

In present study, a three-step multi-objective optimization algorithm of cyclone separators is catered for the design objectives. First, the pressure drop (Dp) and collection efficiency (h) in a set of cyclone separators are numerically evaluated. Secondly, two meta models based on the evolved Group Method of Data Handling (GMDH) type neural networks are regarded to model the Dp and h as the re...

متن کامل

Evaluating the Efficiency of Firms with Negative Data in Multi-Period Systems: An Application to Bank ‎Data

Data Envelopment Analysis (DEA) is a mathematical technique to evaluate the performance of firms with multiple inputs and outputs. In conventional DEA models, the efficiency scores of Decision Making Units (DMUs) with non-negative inputs and outputs are evaluated in a special period of time. However, in the real world there are situations wherein performance of firms must be evaluated in multip...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biometrics

دوره 63 3  شماره 

صفحات  -

تاریخ انتشار 2007